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Abstract 

A system and method are 
described for automating 
navigations in systems with 
measurement data that are 
structured as a 
multidimensional database 
(MDDB). The navigation 
includes an Automated 
Navigation Engine that uses 
external representations of a 
dimension selection function 
and a cube selection function 
to guide MDDB navigations. 
The method consists of the 
steps: (1) select the MDDB 
dimension that has the 
largest dimension selection 
score; (2) if this score is too 
small, terminate the method; 
(3) otherwise, select the cube 
with the largest cube 
selection score. These steps 
are performed repeatedly, 
until (2) applies. Navigations 
can be performed for data 
mining, problem isolation, 
trending analysis, etc. 
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Claims 

What is claimed is: 

1. A system for obtaining information by 
automatically navigating measurement data that 
is structured into cubes comprising a plurality of 
dimensions, the system being associated with a 
multidimensional database component for 
specifying said cubes, said system comprising: 

(a) a dimension selection component that 
quantifies information provided by navigating 
to a dimension; 

(b) a cube selection component that 
quantifies the extent to which a cube provides 
information; and 

(c) an automated navigation engine that uses 
the multidimensional database component, 
the dimension selection component and the 
cube selection component to automate 
navigations by iteratively selecting the best 
cube within successively selected cube 
dimensions to arrive at the cube having the 
desired information. 

2. The system of claim 1 wherein said system is 
adapted to run at least one problem isolation 
application and wherein said navigations isolate 
problems. 

3. The system of claim 1 wherein said 
navigations comprise mining of data stored in 
said database. 

4. The system of claim 1 wherein said 
navigations comprise trending analysis. 

5. A method for a system to automatically 
navigate measurement data for information 
gathering, wherein said measurement data is 
structured into a plurality of cubes comprising a 
plurality of dimensions, said method comprising 
the steps of: 
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(a) characterizing the information potential of 
said data for each of said plurality of 
dimensions; 

(b) selecting the dimension having the 
greatest information potential based upon 
said characterizing; 

(c) comparing the dimension information 
potential of said selected dimension to a 
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preset threshold; 

(d) terminating the method if the dimension 
information potential for said selected 
dimension is less than said threshold; 

(e) determining the cube information potential 
for at least one of said plurality of cubes 
within said selected dimension when said 
dimension information potential is not less 
than said threshold; 

(f) selecting the cube within the selected 
dimension with the largest cube information 
potential, and 

(g) iteratively selecting the best cube within 
successively selected cube dimensions by 
repeating steps (b) through (f) to arrive. at the 
cube having the desired information. 

6. The method of claim 5 wherein said 
characterizing comprises quantifying the 
information potential. 

7. The method of claim 6 wherein said system 
has at least one associated dimension selection 
component for operating with a dimension 
selection function, and wherein said quantifying 
comprises computing a dimension selection 
score by said dimension selection function for 
each of said plurality of dimensions. 

8. The method of claim 6 wherein said system 
has at least one associated cube selection 
component for operating with a cube selection 
function, and wherein said selecting said cube 
comprises computing a cube selection score by 
said cube selection function. 

9. The method of claim 7 wherein said system 
further comprises at least one associated cube 
selection component for operating with a cube 
selection function, and wherein said selecting 
said cube comprises computing a cube 
selection score by said cube selection function. 

10. The method of claim 9 wherein at least one 
of said cube selection function and said 
dimension selection function are externally 
specified. 

1 1 . The method of claim 8 wherein said cube 
selection function is externally specified. 

12. The method of claim 7 wherein said 
dimension selection function is externally 
specified. 

13. A method for providing automated problem 
isolation for a computer system adapted for 
detecting a problem, for gathering relevant 
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problem data, and for providing said data in a 
cube, comprising the steps of: 

(a) obtaining an initial cube of relevant data 
from said computer system; 

(b) selecting a cube dimension that best 
isolates the problem; 

(c) comparing said selected cube dimension 
to a threshold; 

(d) finding the best cube in said selected 
cube dimension when said selected cube 
dimension is not less than said threshold; 

(e) repeating steps (b) through (d) until said 
selected cube dimension is less than said 
threshold. 

1 4. The method of claim 1 3 further comprising 
the step of obtaining a dimension score function 
to control the granularity of said problem 
isolation. 

1 5. The method of claim 1 3 wherein said finding 
the best cube in said selected cube dimension 
comprises the steps of: 

(a) selecting a present dimension score; 

(b) identifying a successive dimension in said 
cube; 

(c) computing a successive dimension score 
for said successive dimension; 

(d) comparing said successive dimension 
score to said present dimension score; 

(e) replacing said present dimension score 
with said successive score when said 
successive score better isolates the problem; 

(f) determining whether all dimensions have 
been analyzed; and 

(g) repeating steps (b) through (f) until all 
dimensions have been analyzed. 

16. The method of claim 15 further comprising 
applying at least one of a cube score function 
and a dimension score function for computing 
scores. 

17. The method of claim 16 wherein at least 
one of said cube score function and said 
dimension score function is externally specified. 



18. A program storage device readable by 
machine, tangibly embodying a program of 
instructions executable by the machine to 
perform method steps for a system associated 
with said machine to navigate measurement 
data for information gathering, wherein said 
measurement data is structured into a plurality 
of cubes comprising a plurality of dimensions, 
said method steps comprising: 



http://vvw.googlexom/patents?vid-USPAT6330564&id=ANMlAAAAEBAJ&dq=metric... I /l 2/2007 



System and method for automated problem... - Google Patents 



Page 4 of 5 



(a) characterizing the information potential of 
said data for each of said plurality of 
dimensions; 

(b) selecting the dimension having the 
greatest information potential based upon 
said characterizing; 

(c) comparing the dimension information 
potential of said selected dimension to a 
preset threshold; 

(d) terminating the method if the dimension 
information potential for said selected 
dimension is less than said threshold; 

(e) determining the cube information potential 
for at least one of said plurality of cubes 
within said selected dimension when said 
dimension information potential is not less 
than said threshold; 

(f) selecting the cube within the selected 
dimension with the largest cube information 
potential, and 

(g) iteratively selectinq the best cube within 
successively selected cube dimensions by 
repeating steps (b) through (f) to arrive at the 
cube having the desired information. 

1 9. The device of claim 1 8 wherein said 
characterizing comprises quantifying the 
information potential. 

20. The device of claim 19 wherein said system 
has at least one associated dimension selection 
component for operating with a dimension 
selection function, and wherein said quantifying 
comprises computing a dimension selection 
score by said dimension selection function for 
each of said plurality of dimensions. 

21. The device of claim 19 wherein said system 
has at least one associated cube selection 
component for operating with a cube selection 
function, and wherein said selecting said cube 
comprises computing a cube selection score by 
said cube selection function. 

22. The device of claim 20 wherein said system 
further comprises at least one associated cube 
selection component for operating with a cube 
selection function, and wherein said selecting 
said cube comprises computing a cube 
selection score by said cube selection function. 

23. The device of claim 22 wherein at least one 
of said cube selection function and said 
dimension selection function are externally 
specified. 

24. The device of claim 21 wherein said cube 
selection function is externally specified. 
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25. The device of claim 20 wherein said 
dimension selection function is externally 
specified. 
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Abstract 

A novel method of and system 
and procedures for more 
accurately measuring the 
resource usage of UNIX 
processes by sampling 
methods involving appropriate 
corrections for the resource 
usage of the terminated 
processes and analyzing UNIX 
process data along with 
subsystem data such as 
RDBMSs, allowing system 
administrators and mangers to 
get a much better picture of 
who is using the resources on 
the system and thus perform a 
better job at performance 
analysis and capacity planning, 
the technique also enabling and 
reducing the error in the 
process measurements 
collected by sampling of the 
resources usage measured by 
the operating system and 
correlating the measurements 
taken by subsystems with the 
measurements taken by the 
operating system. 
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Claims 

What is claimed is: 

1 . A method of reducing the errors in the 
measurements of the usage of resources 

such as CPU's by computer system 
processes, for such purposes as 
performance analysis and planning, that 
comprises, measuring the resource usage 
by the operating system processes of the 
computer system by periodically sampling 
the CPU(s) to determine whether idle or 
apparently busy, and if busy, with which 
process; correction the measurement of 
resource usage of terminated processes; 
measuring the resource usage by one or 
more process-implemented subsystems of 
the computer system by periodic sampling; 
and correlating the measurements taken 
by the subsystems with those taken by the 
operating system as corrected. 

2. A method as claimed in claim 1 and in 
which, during the collection of resource 
usage data, enhancing data collection by 
recreating data lost between samples. 

3. A method as claimed in claim 2 and in 
which, during each sampling period, the 
resource usage data is collected from the 
operating system for each process; and in 
the event that a process that was running 
during a previous sampling period is 
observed no longer to be running, 
correspondingly to modify the usage data in 
accordance with earlier usage data. 

4. A method as claimed in claim 3 and in 
which the computer system is implemented 
as a UNIX system wherein every process has 
a parent process from which child processes 
are forked in a tree-like hierarchy, and in 
which subsystems are implemented by sets 
of processes. 

5. A method as claimed in claim 4 and in 
which, whenever it is observed in a sampling 
period that a process that was running during 
the previous sampling period is terminated, 
modifying the resource usage in accordance 
with the usage recorded by the parent 
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process. 

6. A method as claimed in claim 5 and in 
which detailed process data is converted into 
resource usage statistics correlated with 
appropriate workloads and user entities, 
corrected by process capture ratios. 

7. A system for reducing the errors in the 
measurements of the usage of resources by 
computer system processes, having, in 
combination with the computer system, 
periodic sampling measurement means for 
measuring the resource usage by the 
computer operating system of the resources; 
means for correcting the measuring of 
resource used by takings into account 
terminated processes; and means for 
analyzing the resulting data to provide an 
improved picture of resource utilization. 

8. A system as claimed in claim 7 and in 
which there is provided means for measuring 
by periodic sampling the resource usage by 
one or more process-implemented 
subsystems of the computer system; and 
means for correlating the measurements 
taken by the subsystems with those taken by 
the operating system. 

9. A system as claimed in claim 8 and in 
which means is provided, operable during the 
collection of resource usage data, for 
enhancing data collection by recreating data 
lost between samples. 

10. A system as claimed in claim 9 and in 
which means is provided for modifying the 
usage data to account for the terminating of 
resource usage by a process in a sampling 
period where that process was operating in a 
previous sampling period. 

1 1 . A system as claimed in claim 9 and in 
which the computer system operates as a 
UNIX system with every process having a 
parent process from which are forked child 
processes in a tree-like hierarchy, and in 
which subsystems are implemented by sets 
of processes. 

1 2. A system as claimed in claim 1 1 and in 
which, means is provided, operable in the 
event that a previously running process is 
indicated as terminated in a sampling period, 
for modifying the resource usage 
measurement in accordance with the usage 
indicated by the parent process. 
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13. A system as claimed in claim 12 and in 
which means is provided for converting 
detailed process data into resource usage 
statistics correlated with appropriate 
workloads and user entities, corrected by 
process capture ratio. 

14. A system as claimed in claim 12 and in 
which the subsystem is an ORACLE system 
implemented in the UNIX system. 

15. A system as claimed in claim 12 and in 
which the subsystem is a SYBASE system 
incorporated in the UNIX system. 

16. A method as claimed in claim 4 and in 
which the subsystem is the ORACLE system 
and the ORACLE instance view of its total 
CPU utilization and that of its individual 
sessions are measured, and discrepancies in 
the measurements of CPU utilization caused 
by sampling errors in the session level 
utilization are resolved by establishing 
relative reliability of measurements at the 
different levels of the system-wide CPU 
utilization measurement, UNIX process level 
CPU utilization measurement, the ORACLE 
measuring at the ORACLE instance level, 
and the ORACLE measurement for individual 
sessions, with the system-wide CPU 
utilization measurement being treated as 
more reliable than the UNIX process level 
CUP utilization measurement, which is 
treated as more reliable than the ORACLE 
instance measurement, and which, in turn, is 
treated as more reliable than the ORACLE 
measurement for individual sessions; and 
reconciling differences in data measured at 
the different levels; and analyzing the 
ORACLE instance data with the UNIX level 
performance data to compute reliable 
estimates of instance, session and workload 
utilizations. 

17. A method as claimed in claim 4 and in 
which the subsystem is the SYBASE system 
and the SYBASE instance view of its total 
CPU utilization and total CPU utilization for 
individual SYBASE users and that of its 
individual sessions are measured, and 
discrepancies in the measurements of CPU 
utilization caused by sampling errors in the 
session level utilization are resolved by 
establishing relative reliability of 
measurements at the different levels of the 
system-wide CPU utilization measurement, 
the UNIX process level CPU utilization 
measurement, the SYBASE CPU utilization 
measurement, by the SYBASE instance 



http://www.google.com/patents?vid=USPAT5761091 



l/l 2/2007 



Method and system for reducing the errors in... - Google Patents 



Page 4 of 5 



level, and the SYBASE measurements for 
user CPU utilization and for individual 
sessions, with the system-wide CPU 
utilization measurement being treated as 
more reliable than the UNIX process level 
CPU utilization measurement, which is 
treated as more reliable than the SYBASE 
instance measurement, and, which, in turn, is 
treated as more reliable than the SYBASE 
measurement of user CPU utilization which is 
more reliable than the SYBASE individual 
sessions measurements; and reconciling 
differences in data measured at the different 
levels; and analyzing the SYBASE instance 
data with the UNIX level performance data to 
compute reliable estimates of instance, 
session and workload utilizations. 

18. A method of reducing the errors in the 
measurement and computing of resource 
usage by a series of levels of computer 
system processes and by the workload and 
sessions level process-implemented 
subsystems, that comprises, measuring 
resource usage data at different levels of the 
system and subsystem; establishing a 
relative reliability order of the measurements 
at the different levels; establishing the affinity 
or strength of the relationship between any 
set of levels of measurement; and 
correspondingly correcting measurements at 
each level. 

19. A method as claimed in claim 18 and in 
which usage activity data is collected at all 
levels, with the sum of activity at one level 
becoming the initial approximation of activity 
at the next level; and reconciling data at 
successive levels in accordance with said 
establishing of relative reliability, thereby 
providing a unified way for dealing with data 
at different levels. 

20. A method as claimed in claim 19 and in 
which the measurements are of CPU 
utilization of the system, the system is UNIX, 
and the subsystem is one of ORACLE and 
SYBASE, and in which the reconciling of 
differences in data measured at the 
respective different levels involves analyzing 
the respective instance level data with the 
UNIX level performance data to compute 
reliable estimates of instance, session and 
workload level utilizations. 
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